kyoto: Kyoto University Baseline at the NTCIR-11 MedNLP-2 Task
نویسندگان
چکیده
Since more electronic records are now used at medical scenes, the importance of technical development for analyzing such electronically provided information has been increasing significantly. This NTCIR-11 MedNLP-2 Task is designed to meet this situation. This task is a shared task that evaluates natural language processing technologies especially on Japanese medical texts. The task has three subtasks: (1) the Extraction task, which is to recognize complaints and diagnoses in medical texts; (2) the Normalization task, which is the ICD-coding task for complaint and diagnosis in the texts; (3) free task. This paper is the report on our results. For the Extraction task, we used a standard named entity recognition technique that is based on conditional random fields. For the normalization task, we used the string similarity between the input term and the MEDIS ICD-10 dictionary. For the free task, we proposed to design a glossary of medical terms for patients. The experimental results in the Extraction task showed reasonably high performance (precision: 77.10%, recall: 17.74%, F-measure: 58.97). However, the results in the Normalization task showed low performance (precision: 33.69%, recall: 33.69%, Fmeasure: 33.69). Finally, we show an example of the glossary described above as the result of the free task.
منابع مشابه
A NLP System of DCUMT in NTCIR-11 MedNLP-2: RNN for ICD/Time Entity Recognition and ICD Classification Tasks
This paper describes the medical NLP system developed at Dublin City University for participation in the Second Medical NLP Shared Task (MedNLP 2) in NTCIR-11 [1]. This shared task is a Japanese task. Our system detects International Classification of Diseases (ICD) and time entities and classifies ICD entities. We participated in the task 1 which detects the ICD and time entities, and the task...
متن کاملOverview of the NTCIR-11 MedNLP-2 Task
Electronic medical records are now often replacing paper documents, and thus the importance of information processing in medical fields has increased. We have already organized the NTCIR-10 MedNLP pilot task. It has been the very first shared task attempt to evaluate technologies to retrieve important information from medical reports written in Japanese, whereas the NTCIR-11 MedNLP-2 task has b...
متن کاملKYOTO at the NTCIR-12 Temporalia Task: Machine Learning Approach for Temporal Intent Disambiguation Subtask
This paper describes the Kyoto system for Temporal Intent Disambiguation (TID) subtask in the NTCIR-12 Temporal Information Access (Temporalia-2) challenge. The task is to estimate the distribution of temporal intents (Past, Recency, Future, Atemporal) of a given query. We took a supervised machine learning approach, using features of bag of words, POS and word vectors. We also incorporated kno...
متن کاملNTCIR-10 MedNLP Task Baseline System
Natural language processing (NLP) technology that handles clinical, medical and health records has been drawn much attention, because such kinds of records potentially could be rich clinical resources. This paper describes an NLP system that extracts two kinds of information from clinical documents in Japanese, which was developed as a baseline system in the NTCIR-10 MedNLP Pilot Task. Since ou...
متن کاملNCU IISR System for NTCIR-11 MedNLP-2 Task
This paper describes NCU IISR’s Japanese ICD-10 Code Linking system for NTCIR-11 MedNLP. Our system uses Conditional Random Fields (CRFs) to label ICD-10 mentions and temporal expressions. We also use CRFs to detect the modalities of the ICD-10 mentions. To resolve the problem of ICD-10 mention normalization, we use the Lucene engine to link mentions to the corresponding ICD-10 database entries...
متن کامل